Modelling duration for different text materials
نویسندگان
چکیده
Rules for segmental duration has been studied in the context of a speech database that is under development in our department. The database search procedures include the same kind of context sensitive rules that are used in our speech synthesis project. This gives us the possibility to make a direct comparison to the database durations when developing durational rules for a text-to-speech system. Different kinds of speech material have been studied, induding a novel and read sentences. Some different descriptive frameworks have been tried. A modified version of a rule structure suggested by Klatt has proven to be especially useful.
منابع مشابه
Speech Timing in Slovenian Tts
Speech timing at different speaking rates was studied for the Slovenian language and the results were applied for duration modelling in the Slovenian text-to-speech system S5 [1]. In order to enable the synthesiser to pronounce input text with several speaking rates, tests were made to study the impact of speaking rate on syllable duration and duration of individual phonemes and phoneme groups ...
متن کاملSpeech Timing in Slovenian Tts
Speech timing at different speaking rates was studied for the Slovenian language and the results were applied for duration modelling in the Slovenian text-to-speech system S5 [1]. In order to enable the synthesiser to pronounce input text with several speaking rates, tests were made to study the impact of speaking rate on syllable duration and duration of individual phonemes and phoneme groups ...
متن کاملAutomatic Parameters Estimation of the D. Klatt Phoneme Duration Model
Phoneme duration modelling is one of the stages in prosody modelling for text-to-speech systems. The rule-based phoneme duration model proposed by Klatt (1979) is still quite a popular method. One of themain shortcomings of thismethod is that the values of the parameters are selected in an experimental way. This work proposes a new iterative algorithm for the automatic estimation of the factors...
متن کاملProsody modelling in Czech text-to-speech synthesis
This paper describes data-driven modelling of all three basic prosodic features – fundamental frequency, intensity and segmental duration – in the Czech text-to-speech system ARTIC. The fundamental frequency is generated by a model based on concatenation of automatically acquired intonational patterns. Intensity of synthesised speech is modelled by experimentally created rules which are in conf...
متن کاملSegmental duration modelling in a text-to-speech system for the galician language
In this contribution we propose a segmental duration model for the Galician language. We have focused our work on the study of allophonic durations in their syllabic environment. Firstly, a study of the speech rate over a recorded corpus led us to consider different behaviours in certain types of sentences. Secondly, the corpus was analyzed in order to determine the main factors affecting durat...
متن کامل